Patch Similarity Aware Data-Free Quantization for Vision Transformers
نویسندگان
چکیده
Vision transformers have recently gained great success on various computer vision tasks; nevertheless, their high model complexity makes it challenging to deploy resource-constrained devices. Quantization is an effective approach reduce complexity, and data-free quantization, which can address data privacy security concerns during deployment, has received widespread interest. Unfortunately, all existing methods, such as BN regularization, were designed for convolutional neural networks cannot be applied with significantly different architectures. In this paper, we propose PSAQ-ViT, a Patch Similarity Aware framework Transformers, enable the generation of "realistic" samples based transformer's unique properties calibrating quantization parameters. Specifically, analyze self-attention module's reveal general difference (patch similarity) in its processing Gaussian noise real images. The above insights guide us design relative value metric optimize approximate images, are then utilized calibrate Extensive experiments ablation studies conducted benchmarks validate effectiveness even outperform real-data-driven methods. Code available at: https://github.com/zkkli/PSAQ-ViT.
منابع مشابه
Distributed query-aware quantization for high-dimensional similarity searches
The concept of similarity is used as the basis for many data exploration and data mining tasks. Nearest Neighbor (NN) queries identify the most similar items, or in terms of distance the closest points to a query point. Similarity is traditionally characterized using a distance function between multi-dimensional feature vectors. However, when the data is high-dimensional, traditional distance f...
متن کاملClass-Aware Similarity Hashing for Data Classification
This paper introduces “class-aware similarity hashes” or “classprints,” which are an outgrowth of recent work on similarity hashing. The approach builds on the notion of context-based hashing to create a framework for identifying data types based on content and for building characteristic similarity hashes for individual data items that can be used for correlation. The principal benefits are th...
متن کاملInterval Similarity-Based Quantization Method for Continuous Data
Data quantization methods for continuous attributes play an extremely important role in artificial intelligence, data mining and machine learning because discrete values of attributes are required in most classification methods. In this paper, we present an interval similarity-based quantization method for continuous data. It defines an interval similarity criterion which is regarded as a new m...
متن کاملBohr: Similarity Aware Geo-distributed Data Analytics
We propose Bohr, a similarity aware geo-distributed data analytics system that minimizes query completion time. The key idea is to exploit similarity between data in different data centers (DCs), and transfer similar data from the bottleneck DC to other sites with more WAN bandwidth. Though these sites have more input data to process, these data are more similar and can be more efficiently aggr...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Lecture Notes in Computer Science
سال: 2022
ISSN: ['1611-3349', '0302-9743']
DOI: https://doi.org/10.1007/978-3-031-20083-0_10